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Abstract 



We show that the binary logarithm of the non-negative rank of a non-negative matrix 
is, up to small constants, equal to the minimum complexity of a randomized communi- 
■ cation protocol computing the matrix in expectation. We use this connection to prove 

. new conditional lower bounds on the sizes of extended formulations, in particular, for 

\ perfect matching polytopes. 

in : 1 Introduction 

o ■ 

Extended formulations are a powerful tool for minimizing linear or, more generally, convex 
functions over polyhedra (see, e.g., Ziegler 1120 II for background on polyhedra and poly- 
topes). Consider a polyhedron P in R d and a convex function cp : R d — > R, that has to be 
^ [ minimized over P. If a small size linear description of P is known, then minimizing cp over 
P can be done efficiently using an interior point algorithm, or the simplex algorithm if cp is 
linear and theoretical efficiency is not required. 

However, P can potentially have many facets. Or worse: it can be that no explicit com- 
plete linear description of P is known. This does not necessarily make the optimization 
problem at hand difficult. A fundamental result of Grotschel, Lovasz and Schrijver ||8]] states 
that if there exists an efficient algorithm solving the separation problem for P, then optimiz- 
ing over P can be done efficiently. However, this result uses the ellipsoid algorithm, which 
is useless practically. It is desirable to avoid using the ellipsoid algorithm. 

Now suppose that there exists a polyhedron Q in a higher dimensional space R e such 
that P is the image of Q under a linear projection n : R e — > R d . The polyhedron Q together 
with the projection n define an extended formulation, or extension of P. Minimizing cp over 
P amounts to minimizing cp o n over Q. If Q has few facets, then we can resort to an interior 
point algorithm or the simplex algorithm to solve the optimization problem. Of course, one 
should also take into account the size of the coefficients in the linear description of Q and in 
the matrix of rc, but this will not be the main focus here. 
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The success of extended formulations is due to the fact that a moderate increase in di- 
mension can result in a dramatic decrease in the number of facets. For instance, P can have 
exponentially many facets, while Q has only polynomially many We will see examples of 
this phenomenon later in this paper. For more examples, and background, see the recent 
survey by Conforti, Cornuejols and Zambelli [3]. 

In light of the above discussion, it is natural to define the size of an extension Q as the 
number of facets of Q, and the extension complexity of a polyhedron P as the minimum size 
of an extension of P. Following ||6]|, we denote this by xc(P). The extension complexity of a 
polyhedron is a far better measure of how "complex" a polyhedron is than, for instance, its 
number of facets or its number of vertices and extreme rays. 

Because we mainly consider polytopes, we assume from now on that P is bounded, that 
is, P is a polytope. This is not a major restriction. In this case, one may assume without 
loss of generality that Q is also polytope (see below). So consider a polytope P in R d with 
m facets and n vertices. Let hi, . . . , h m be m affine functions on R d such that h\ (x) ^ 0, . . . , 
h m (x) ^ are the facet-defining inequalities of P. Let also V\, . . . , v n denote the vertices of 
P. The slack matrix of P is the non-negative m x n matrix S = S(P) = (s z -,-) with su = hj(vj). 

A rank r non-negative factorization of a non-negative matrix M is an expression of M 
as product M = AB where A and B are non-negative matrices with r columns and r rows, 
respectively. The non-negative rank of M, denoted by rank + (M), is the minimum natural r 
such that M admits a rank r non-negative factorization @. Observe that the non-negative 
rank of M can also be defined as the minimum r G N such that M is the sum of r non- 
negative rank one matrices. 

In a seminal paper, Yannakakis HT9l proved, among other things, that the extension com- 
plexity of a polytope is precisely the non-negative rank of its slack matrix (see also [6J). 

Theorem 1. For all polytopes P, 



Before going on, we sketch the proof of half of the theorem. Assuming P = {x G R . 
Ex ^ g}, consider a rank r non-negative factorization S(P) = FV of the slack matrix of 
P. Then it can be shown that Q := {(x,y) G R d+r : Ex + Fy = g,y ^ 0} is an extension 
of P. Notice that Q has r facets and r extra variables. Taking r = rank+ (S(P)) implies 
xc(P) ^ rank + (S(P)). Moreover, in this case Q is bounded, see Lemma [TTl in the appendix 
for a proof. 

In the work of Yannakakis IIT91 also appeared a connection between extended formu- 
lations and communication complexity (the book of Kushilevitz and Nisan |[TT| is a stan- 
dard reference on communication complexity). Every deterministic communication proto- 
col computing a non-negative matrix M (traditionally M is a binary matrix) yields a non- 
negative factorization of M, and thus an extended formulation. Indeed, such a protocol 
defines a partition of the matrix into sub-matrices whose entries are all equal. Notice that 
the rows and columns of such a "monochromatic" sub-matrix are not necessarily consecu- 
tive. Each sub-matrix yields a non-negative rank one matrix, and the sum of the resulting 
matrices is precisely M. The rank of this non-negative factorization of M is at most 2 C , where 
c is the complexity of the protocol. When M is the slack matrix of a polytope P, we obtain 
an extension of P. 

Notably, Yannakakis 11191 used this connection to obtain a subexponential size extended 
formulation for the stable set polytope of a n-vertex perfect graph from a deterministic com- 
munication protocol computing the corresponding slack matrix with polylogarithmic com- 



xc(P) = rank+ (S(P)) . 




1 This protocol was simplified by Hajnal, see the survey by Lovasz |[T3l . 
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The aim of this paper is to prove new results on extended formulations by tightening 
the connection between extended formulations, non-negative factorizations and communi- 
cation complexity. 

In Section |2j, we define the different polytopes considered here, and describe their facets 
and vertices. 

In Section |3l we discuss deterministic and randomized communication protocols and 
define what it means for a randomized communication protocols with private randomness 
to compute a given non-negative matrix M in expectation. 

Then we prove in Sections [4] and [5] that the minimum complexity of a randomized proto- 
col computing M in expectation is, up to small additive constants, the binary logarithm of 
the non-negative rank of M. This is done in two parts. Let c denote the minimum complex- 
ity of a randomized protocol computing M in expectation, and let r := rank+(M). First, in 
Section HJ, we prove the inequality lgr ^ c. Second, in Section [5} we prove the converse in- 
equality c ^ lgr + 0(l). The two inequalities together imply lgr = c + 0(1). By Theorem [1} 
we obtain a new characterization of the extension complexity of polytopes. 

Finally, in Section [6| we use this characterization to prove new results on extended for- 
mulations of perfect matching polytopes, a prominent family of polytopes for which the 
extension complexity is unknown. Yannakakis IIT9B proved every symmetric extension that 
the perfect matching polytope of the complete graph K n has exponential size. Here, we show 
roughly that there is a tradeoff between the amount of randomness used by an extension of 
the perfect matching polytope K n , regarded as a randomized protocol, and the size of this 
extension. In particular, we prove that if the protocol detects non-zero entries of the slack 
matrix with constant probability, then the extension has exponential size. 

Kaibel, Pashkovich and Theis [91 showed that the restriction to only symmetric exten- 
sions is a rather strong one. They showed that there exists polytopes with no polynomial 
size symmetric extension but that admit polynomial size extensions when the symmetry 
restriction is dropped. At the end of the section, we observe a similar phenomenon with 
our conditional lower bound. We show that every extension of the spanning tree polytope 
of K n that, seen as a randomized protocol, detects non-zero entries of the slack matrix with 
constant probability has exponential size, even though an 0(n 3 ) size extension exists. 

Finally we conclude this paper with a discussion of some related problems, conjectures, 
and future work. 



2 Polytopes relevant to this work 

Now we describe briefly various families of polytopes relevant to this paper. For a more 
detailed description of these polytopes, we refer the reader to [T7|. 

Let J be a finite ground set. The incidence vector of a subset / C I is the vector £ ^ 
defined as 

/ / l if i e J 
Xi { o if i £ J 

for i G I. For x G R 7 , we let x(f) := YUe] x i- 

Throughout this section, G = (V, E) denotes a (simple, undirected) graph. For a subset 
of vertices U C V, we denote the edges of the subgraph induced by U as E[U]. The cut 
defined by U, denoted as S(U), is the set of edges of G exactly one of whose endpoints is in 
U. That is, 

E[U] = {uv G E : u G U,v G U} , and 
S(U) = {uv EE:ueU,v £11} . 
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Later in this paper, we will often take G to be the complete graph K n with vertex set 
V(K n ) = [n] := {!,..., n} and edge set E(K n ) = {ij : i,j G [n],i ^ /}. 



2.1 Spanning Tree Polytope 

A spanning tree of G is tree T whose set of vertices and edges respectively satisfy V(T) = V 
and E(T) C E. The spanning tree polytope of G is the convex hull of the incidence vectors 
of the spanning trees of G, i.e., 

Pspanning tree(G) = conv{^ E(T) G IR E : T spanning tree of G} . 

Edmonds showed that this polytope admits the following linear description (See also 
CI, page 861) : 

x(E[U]) ^ |LT| — 1 for nonempty U C V , (1) 
x(E) = \V\-1, (2) 
x e ^ for e G E . (3) 

This follows, e.g., from the fact that the spanning tree polytope of G is the base polytope of 
the graphic matroid of G. 



2.2 Perfect Matching polytope 

A perfect matching of G is set of edges MCE such that every vertex of G is incident to 
exactly one edge in M. The perfect matching polytope of the graph G is the convex hull of 
the incidence vectors of the perfect matchings of G, i.e., 

Pperfect matching (G) = conv{^ M G R E : M perfect matching of G} . 

Edmonds (H showed that the perfect matching polytope of G is described by the following 
linear constraints (See also 11X611 , page 438): 

x(5(U)) ^ 1 for U C V with odd, (4) 
x(5(v)) = 1 foreach^Gy, (5) 
x e ^ for e G E . (6) 



2.3 Stable set polytope 

A stable set S (often also called an independent set) of G is a subset of the vertices such that 
no two of them are adjacent. A clique K of G is a subset of the vertices such that every two 
of them are adjacent. The stable set polytope STAB(G) of a graph G(V,E) is the convex hull 
of the incidence vectors of the stable sets in G, i.e., 

STAB(G) = conv{x s G R y : S stable set of G} . 

No complete linear description of the stable set polytope for arbitrary graphs is known. 
It is, however, known that the following inequalities are valid for STAB(G) for any graph G: 

x(K) < 1 for each clique K of G , (7) 
x v ^ for v G V . (8) 

Inequalities are called the clique inequalities. See Schrijver |fT7| for details. 

A graph G is called perfect if the chromatic number of every induced subgraph equals 
the size of the largest clique of that subgraph. It is known that G is perfect if and only if the 
above inequalities completely describe STAB(G). 
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3 Communication complexity 



Let X, Y and Z be arbitrary finite sets with Z C R + , and let / : X x Y — >■ Z be a function. 
Suppose that there are two players Alice and Bob who wish to compute f(x,y) for some 
inputs x G X and y G Y. Alice knows only x and Bob knows only y. They must therefore 
exchange information to be able to compute the value of f(x,y), even though each player 
possesses unlimited computational power. 

The communication is carried out as a protocol that is agreed on beforehand by Alice 
and Bob, on the sole basis of the function /. At each step of the protocol, one of the player 
has the token. He/ she sends a bit to the other, that depends only on his/her input and on 
previously sent bits. The transmitted bit determines which player has the token in the next 
step. This is repeated until the value of / on (x, y) is known by both players. The minimum 
number of bits exchanged between the players in the worst case to be able to evaluate / by 
any protocol is called the communication complexity of /. 

In this section we describe deterministic protocols briefly and then randomized proto- 
cols (with private random bits). In the literature, a randomized protocol is said to compute 
a function / if for all inputs (x,y) G X x Y the protocol outputs the correct value, namely 
f(x,y), with high probability. Here we consider a new notion of computation where the 
value output by the protocol on input (x,y) has to equal f(x,y) in expectation. For a thor- 
ough description of deterministic as well as randomized protocols (with the usual notion of 
computation) we refer the reader to |[TT|. 

3.1 Deterministic protocols 

A protocol is best viewed as a rooted binary tree where each internal node is marked either 
Alice or Bob. The leaves have values associated with them. An execution of the protocol 
on a particular input is a root-to-leaf path in the tree. At a node owned by Alice, following 
the path to the left subtree corresponds to Alice sending a zero to Bob and taking the right 
subtree corresponds to Alice sending a one to Bob; and similarly for nodes owned by Bob. 
In case the protocol is deterministic, to each input (x,y) G X x Y corresponds a unique path 
from the root to one of the leaves, and the value at that leaf is f(x,y). Thus none of the 
players uses any randomness to decide which bits to send to the other player. 

More formally, we define a deterministic protocol as a rooted binary tree with some extra 
information attached to its nodes. Each internal node has a type, which is either X or Y. To 
each node v of type X is attached a function p v : X — > {0, 1}; to each node v of type Y is 
attached a function q v : Y — >• {0, 1}; and to each leaf v is attached a number \ v G R+, called 
the value of that leaf. 

An execution of the protocol on input (x,y) G X x Y is a root-to-leaf path that starts 
at the root and descends to a leaf. At any internal node v of type X the execution follows 
the egde to the left child if p v (x) = and to the right child if p v {x) = 1. Similarly, at any 
internal node v of type Y, the execution follows the edge to the left child if q v (y) = and to 
the right child if q v (x) = 1. The value of the execution is the value of the leaf attained by the 
execution. 

A deterministic protocol is said to compute the function / if for each input pair (x, y) the 
value of the execution of the protocol is exactly f(x,y). 

The complexity of a protocol is the height of the corresponding tree. 

These formal definitions capture the informal ones given above. Observe that the nodes 
of type X are assigned to Alice, and those of type Y to Bob. Observe also that Alice and 
Bob have unlimited resources for performing their part of the computation. It is only the 
communication between the two players that is accounted for. 
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Given an ordering x\, ... ,x m of the elements of X, and yi, . . . , y n of the elements of Y, we 
can visualize the function / : X x Y — > Z as am x n non-negative matrix M = M(f) = (ntij) 
such that m-ij = f(xi,yj) for all G [ra] x [n]. 

Now consider a deterministic protocol computing /. Since the protocol is deterministic, 
each of its leaves v determines a subset of rows R = R v and columns C = C v such that any 
input (xi,yj), the execution of the protocol on {x^yj) ends at leaf v if and only if i e R and 
; G C, that is, G Rx C. On each of the inputs (xj,y,-) with (z,;) eRxC, the function / 
evaluates to same value, namely the value at leaf u. The set R x C is called a rectangle. 

When Z7 varies among the leaves of the protocol, the rectangles R v x C v form a partition 
of [m] x [n] . It is easy to see that such a partition can be used to write M as a sum of non- 
negative rank one matrices, one for each leaf. For each leaf v, define a m x n matrix M v 
whose entry in the z'th row and ;th column is given by f(xi,yj) if i G R v and / € C V/ and 
otherwise. Thus the support of M v is R v x C v . Each of these matrices has rank at most one 
and we have that M = YlmeL M v , where L denotes the set of leaves of the protocol. 

If M is the slack matrix of a polytope P, it follows from Theorem[T]that P has an extension 
of size at most |L| ^ 2 C , where c is the complexity of the protocol. This was first observed 
by Yannakakis ||T9| . He proved the existence of a n ( lo g n ) size extension for the stable set 
polytope of a n-vertex perfect graph by giving a 0(log 2 n) complexity deterministic protocol 
for computing its slack matrix. We illustrate this by a related example, that appears to be 
new. 

Example 1. A graph G is called claw free if no vertex has three pairwise non-adjacent neigh- 
bors. Even though the separation problem for STAB(G) for claw free graphs is polynomial- 
time solvable, no explicit description of all its facets is known (see, e.g., UTTIl , page 1216). 
Recently Galluccio et al. gave a complete description of the facets of claw free graphs with 
at least one stable set of size greater than three 0. Also, recall that for a perfect graph G the 
facets of STAB(G) are defined by inequalities © and ©, see Section |Z3l 

Let G be a claw-free, perfect graph with n vertices. We give a deterministic protocol that 
computes the slack matrix of the stable set polytope P := STAB(G) of G. 

Because G is perfect, the (non-trivial part of the) slack matrix of P has the following 
structure: it has one column per stable set S in G, and each one of its rows corresponds to 
a clique K in G. The entry for a pair (K, S) equals if K and S intersect (in which case they 
intersect in exactly one vertex) and 1 if K and S are disjoin1@. 

Consider the communication problem in which Alice is given a clique K of G, Bob is 
given a stable set S of G, and Alice and Bob together want to compute 1 — \K (1 S\. Alice 
starts and sends the name of any vertex u of its clique K to Bob. Then Bob sends the names 
of all the vertices of its stable set S that are in N(u) U {u} to Alice, where N(u) denotes 
the neighborhood of u in G. Finally, Alice can compute K n S because this intersection is 
contained in N(u) U {u} and Alice knows all vertices of S n (N(u) U {u}). She outputs 
1 — |K fl S|. Because G is claw-free, there are at most two vertices in S P\ (N(u) U {u}), thus 
at most 31gn + 0(1) bits are exchanged by Alice and Bob. It follows that there exists an 
extended formulation of STAB(G) of size 0(n 3 ). 

3.2 Randomized protocols 

Randomized protocols are similar to deterministic ones except the players are allowed to 
use random bits to decide what to send to the other player. As mentioned earlier, the no- 

2 This describes the rows of the slack matrix of P that correspond to the clique inequalities 0. The slack 
matrix of P has another, shorter set of rows that correspond to non-negativity inequalities lO, and that we may 
safely ignore (see also Corollary|4]below). 
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tion of computation "in expectation" that we define here differs from the usual notion of 
computation "with high probability". 

Let X and Y be finite sets. A randomized protocol with private random bits (or shortly, 
a randomized protocol) is a rooted binary tree with some extra information attached to the 
nodes. Each internal node has a type, which is either X or Y. To each node v of type X is 
attached a function p v : X — > [0, 1]; to each node v of type Y is attached a function q v : Y — > 
[0, 1]; and to each leaf v is attached a non-negative number X v G R + , called the value of that 
leaf. The functions p v and q v define transition probabilities. 

An execution of the protocol on input (x,y) G X x Y is a random root-to-leaf path that 
starts at the root and descends to the left child of an internal node v with probability p v (x) if 
v is of type X and q v (y) if v is of type Y, and to the right child of v with the complementary 
probability 1 — p v (x) if v is of type X and l—q v (y) if is of type Y. The vaiue of the execution 
is the value of leaf attained by the execution. 

For each fixed input (x, y) G (X, Y), the value of an execution on input (x, y) is a random 
variable. We say that the protocol computes a function f : X x Y ^ Z in expectation if the 
expectation of this random variable on each (x, y) £ X x Y is precisely /(x, y) . 

The complexity of a protocol is the height of the corresponding tree. 

As observed in Section [3TTT we can regard a function /:XxY4 IR + as a non-negative 
matrix M = M(f) with m = |X| rows and n — |Y| columns. Below, as is natural, we will 
not make a distinction between these two types of objects. 



4 Factorizations from protocols 

Theorem 2. If there exists a randomized protocol of complexity c computing a matrix M G R^ xy 
in expectation, then the non-negative rank of M is at most 2 C . 

Proof Each node v of the protocol has a corresponding traversal probability matrix P v G 
R^ xY such that, for all inputs (x,y) G X x Y, the entry P v (x,y) is the probability that an 
execution on input (x, y) goes through node v. We claim that P v is always a rank one matrix. 

We prove this by induction on the depth of a node, starting from the root. When v is the 
root, P v is an all-one matrix because all executions start at the root. Thus P v = 11 T is a rank 
one matrix in this case. 

Next, consider a node u of depth at least one and its parent v. Without loss of generality, 
we assume that v is of type X, that is, v is assigned to Alice. Assume that P v = pq T for some 
non-negative vectors p G R x and^ G K y . Then we have P u = p'q T where p'(x) = p(x)p v (x) 
for x G X in case u is the left child of v, and p'{x) = p(x)(l — p v (x)) for x G X in case u is 
the right child of v. This proves the claim. 

Finally, let L be the set of all leaves of the protocol and A v be the value at leaf v. Because 
the protocol computes M in expectation, for all inputs (x,y) G X x Y we have M(x,y) = 
Yuv&L ^vPv{x,y). Thus, M = YLveL ^vPv- Since the claim holds, each term in this last sum is a 
non-negative rank one matrix. The theorem follows. □ 

Recall that the polytopes considered in this paper have some facet-defining inequali- 
ties enforcing non-negativity of the variables along with other facet-defining inequalities. 
The next lemma and its corollary will allow us to ignore the rows corresponding to non- 
negativity inequalities, and focus on the non-trivial parts of the slack matrices considered 
here. 

Lemma 3. Let M be a non-negative matrix. Let R\,R2 be a partition of the rows of M defining 
partition of M into Mi and Mj_. If there exist randomized protocols computing M\ and Mi in 



7 



expectation with complexity c\ and ci respectively, then there exists a randomized protocol complexity 
computing M with complexity 1 + max{ci, ci}. 

Proof. When Alice gets a row of M she sends a bit to Bob to indicate whether her row lies in 
Ri or R2. Now that both Alice and Bob know whether they want to compute an entry in M\ 
or M2, they use the protocol for that particular submatrix. □ 

Corollary 4. Let P C be a polytope and let S'(P) denote the submatrix of S(P) obtained by 
deleting the rows corresponding to non-negativity inequalities. If there is a complexity c randomized 
protocol for computing S'(P) in expectation, then there is a complexity 1 + max{c, [lgrf] } random- 
ized protocol for computing S(P) in expectation. 

Proof. For computing the part of S(P) that is deleted in S'(P), which corresponds to non- 
negativity inequalities, we use the obvious protocol where Alice sends her row number to 
Bob and Bob computes the slack. Since at most d facets of P are defined by non-negativity 
inequalities, this protocol has complexity flgd]. The corollary thus follows from Lemma [3] 

□ 

For the protocols constructed here, we will always have c ^ [lg d] . Because of Corol- 
lary |H we can thus ignore the non-negativity inequalities without blowing up the size of 
any extension by more than a factor of two. Moreover, in terms of lower bounds, it is always 
safe to ignore inequalities because the non-negative rank of a matrix cannot increase when 
rows are deleted. 

To conclude this section, we give two illustrative examples. The first one is a reinterpre- 
tation of a well-known 0(n 3 ) size extended formulation for the spanning tree polytopes due 



to Martin |14|. The second one concerns the perfect matching polytopes and is implicit in 
Kaibel, Pashkovich and Theis 0. 

Example 2. Let P denote the spanning tree polytope of the complete graph K n , see Section lZll 
The (non-trivial part of the) slack matrix of P has one column per spanning tree T and one 
row per proper nonempty subset U of vertices. The slack of T with respect to the inequality 
that corresponds to U is the number of connected components of the subgraph of T induced 
by U (denoted by T[U] below) minus one. 

In terms of the corresponding communication problem, Alice has a proper nonempty set 
U and Bob a spanning tree T. Together, they wish to compute the slack of the pair (U,T). 
Alice sends the name of some vertex u in U. Then Bob picks an edge e of T uniformly at 
random and sends to Alice the endpoints v and w of e as an ordered pair of vertices (v,w), 
where the order is chosen in such a way that w is on the unique path from v to u in the tree. 
That is, she makes sure that the directed edge (v, w) "points" towards the root u. Then Alice 
checks that v £ U and w £ U, in which case she outputs n — 1; otherwise she outputs 0. 

The resulting randomized protocol is clearly of complexity 31gn + 0(1). Moreover, it 
computes the slack matrix in expectation because for each connected component of T[U] 
distinct from that which contains u, there is exactly directed edge (v, w) that will lead Alice 
to output a non-zero value, see Figure [TJ for an illustration. Since she outputs (n — 1) in this 
case, the expected value of the protocol on pair (U, T) is (n — 1) • (k — l)/(n — 1) = k — 1, 
where k is the number of connected components of T[U]. 

The corresponding extended formulation has size 0(n 3 ). 

For the next example, we will need the fact that one can cover the complete graph K n 
with k = 0(2 M / 2 poly(n)) balanced complete bipartite graphs G\, . . . , G/ c in such a way that 
every perfect matching of K 11 is a perfect matching of at least one of the G,'s. See Lemma H2l 
and its proof in the appendix. 
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Figure 1: Illustration of the protocol in Example |2j The black vertices are those in U. The 
green directed edges are those for which Alice outputs a non-zero value. The number of 
such edges is the number of connected components of T[U] minus one. 

Example 3. Assume that n is even and let P denote the perfect matching polytope of the 
complete graph K n with vertex set [n], see Section IZ2l The (non- trivial part of the) slack 
matrix of P has one column per perfect matching M, and its rows correspond to odd sets 
17 C [n]. The entry for a pair (U, M) is \S(U) n M| - 1 (recall that S(U) denotes the set of 
edges that have one endpoint in U and the other endpoint in II, the complement of U). 

We describe a randomized protocol for computing the slack matrix in expectation, of 
complexity at most (1/2 + e)n, where e > can be made as small as desired by taking n 
large. First, Bob finds an (n/2)-subset X C [n] that is compatible with his matching M, and 
tells the name of this subset to Alice, see Lemma [12] Then Alice checks which of X and X 
contains the least number of vertices of her odd set U . Without loss of generality, assume it 
is X. She picks a vertex u of U n X uniformly at random and send its name to Bob. He replies 
by sending the name of u' , the mate of u in the matching M. Alice then checks whether u' is 
in U or not. If u' is not in U, then she outputs |U| — 1. Otherwise u' is in U, and she outputs 
\U\ — 1 — 2\U nX|. Telling the name of X can be done in at most n/2 + lg y^ + lglgn + 0(1) 
bits. The extra amount of communication is21gn + 0(l) bits. In total, at most (1/2 + s)n 
bits are exchanged, for n sufficiently large (e > can be chosen arbitrarily). 

Now, we check that the protocol correctly computes the slack matrix of the perfect match- 
ing polytope. Letting E[U] denote the edges of the complete graph with both endpoints in 
U, the expected value output by Alice is 

\E[U] n M| 



|unx| 




E[U] nM| 




irnx| 





\u\-iy >- + (\u\-i-2\unx\ 

\E(u) n M| 



irnx 



u -i-2 unx 



|(jnx| 



= |U| -2|£(U) nM| -l 
= \5(U)nM\-l. 

The resulting extension has size at most 2( 1 / 2+e ) M ^ (1.42)" (for e small enough, and n 
large enough), whereas the main result of Yannakakis Ifl9l gives a lower bound of ( n / 4 ) ^ 
(1.74) n for the size of any symmetric extension. 
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5 Protocols from factorizations 



Theorem 5. If the non-negative rank of matrix M £ R™ XM has a rank r non-negative factorization, 
then there exists a randomized protocol computing M in expectation, whose complexity is at most 
lgr + 0(l). 

Proof Let A £ R+ Xr and B £ R^ XM be non-negative matrices such that M = AB. Let A 
denote the maximum row sum of A. Thus, M = (A/ A)(AB). Let A denote the m x (r + 1) 
matrix obtained from A/ A by appending a column whose entries are chosen so that each 
row-sum of A is precisely 1. Thus A is row-stochastic. Let B denote the (r + 1) x n matrix 
obtained from AB by appending a zero row. Notice that M = AB. 

The protocol is as follows: Alice knows a row index i, and Bob knows a column index /'. 
Together they want to compute M(i,j) in expectation, by exchanging as few bits as possible. 
They proceed as follows: Alice selects a column index k £ [r + 1] according to the probabil- 
ities found in row i of matrix A, sends this index to Bob, and Bob outputs the entry of B in 
row k and column j. 

This randomized protocol computes the matrix M in expectation. Indeed, the expected 
value on input (i,f) is E£=l A(i,k)B(k,j) = M(i,j). Moreover, the number of bits exchanged 
is [lg(r + 1)] , thus the complexity of the protocol is at most lg r + 0(1). □ 

Although the above protocol is defined literally, it is quite easy to obtain a protocol tree 
for it. The tree is balanced. All of its internal nodes are assigned to Alice, except those in 
the last layer which are assigned to Bob. There is one such node v(k) for each possible index 
k £ [r + 1] . The node v = v(k) has two children which are both leaves: the left leaf has value 
max{ B(k, £) : £ £ [n] } and the right leaf 0. The transition probability q v {j) is defined in such 
a way that the expected value output is correct, that is, 

q v (j) ■ max{B(k,£) : £ £ [«]} = B(k,j) . 

Another property of the protocol is that it is one-way: one of the player (Bob in this case) 
does not send any bit to the other player. In some sense, the protocol can be regarded 
as being in "normal form". (Always putting a protocol in "normal form" does not seem 
desirable, for instance, when the protocol is deterministic.) 

6 New lower bound for perfect matching polytopes 

We have seen that every extension of a polytope P corresponds to a randomized protocol 
computing its slack matrix S(P) in expectation and vice-versa. Now we show in particular 
that for the perfect matching polytope if we restrict ourselves only to those extensions that 
can determine with a constant probabilty whether or not an entry in the slack matrix is zero, 
then every extension has an exponential size. We then observe that the same holds for the 
spanning tree polytope. 

6.1 A reduction from the set disjointness problem 

The set disjointness problem is the following communication problem: Alice and Bob each 
are given a subset of [n] . They wish to determine whether the two subsets intersect or not. 
In other words, Alice and Bob have to compute the set disjointness function DISJ defined 
by DISJ(A,B) = 1 if A and B are disjoint subsets of [n], and DISJ(A,B) = if A and B 
are non-disjoint subsets of [n] . It is known that any randomized protocol that computes the 
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disjointness function with high probability (that is, the probability that the value output by the 
protocol is bounded by a constant strictly less than 1) has Q(n) complexity see, e.g., Kushile- 
vitz and Nisan flTTl , Babai et. al [1J, Kalyanasundaram and Schnitger ||10|, and Razborov 
E5I. 



To each matrix M G R^ xy , we associate the following communication problem, that we 
call the support problem: Alice is given a row x of M and Bob a column y of M. They wish 
to determine whether M(x, y) = or M(x, y) > 0. In the first case, they output and in the 
second case they output 1. 

Lemma 6. There is a reduction from the set disjointness problem for subsets of [n] to the support 
problem for the slack matrix of the perfect matching polytope for perfect matchings of K e , where 
£ ^ 3n + 8, that uses O(l) extra communication. 

Proof. Let A and B be the sets respectively given to Alice and Bob. After sending 1 bit with 
Alice, Bob and Alice can make sure that both B and its complement [ft] — B contain an even 
number of elements. They do this by adding dummy elements to the initial ground set [w], 
without adding them to A. 

Let k ^ n + 2 denote the number of elements currently in the ground set, and let £ := 
3k + 2 ^ 3ft + 8. We define an odd set U and a perfect matching M as follows. First, we let 



LI 



{i : i e A} U {i + k : i e A} U {3k + 1} 



Second, M is obtained by adding matching edges to the partial matching {{z, i + k} : i £ 
[k] - B} U {{i + k,i + 2k} : i e B} U {{3k + 1,3k + 2}} in such a way that each of the 
extra edges matches two consecutive unmatched vertices both in {i : i G [k]} or both in 
{i + 2fc : i £ [fc]}. See Figure |2] for an example. 

It can be easily verified that A and B are disjoint if and only if the slack for (U, M) is zero. 
The theorem follows. □ 
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Figure 2: Constructing an odd set and a perfect matching from a set disjointness instance. 
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6.2 The new lower bound 



Theorem 7. Consider an extended formulation for the perfect matching poly tope of K n and a cor- 
responding randomized protocol computing the slack matrix of this polytope in expectation. If the 
probability that the protocol outputs a non-zero value, given a pair (U, M) with positive slack, is at 
least pin) ^> 1/n, then the protocol has complexity Q(np(n)) and the extended formulation has 
size 2 q ( w pM). 

Proof. Let c be the complexity of the randomized protocol computing the (non-trivial part of 
the) slack matrix of the perfect matching polytope of K n in expectation. From this protocol, 
we obtain a new randomized protocol for the corresponding support problem by [1/ p(n)~\ 
independent executions of the given protocol, and outputting 1 if at least one of the exe- 
cutions led to a non-zero value or otherwise. The new protocol is such that, for all pairs 
(U, M) with a positive slack, the probability of outputting a zero value is at most 

where e is the Euler's number. Thus, there is constant probability that the value returned 
by the algorithm is positive. This gives a protocol of complexity 0(c/p(n)) for the sup- 
port problem for the slack matrix of the perfect matching polytope. The theorem follows 
directly from Lemma [6] and from the fact that the set disjointness problem has randomized 
communication complexity Q(n). □ 

For instance, deterministic protocols for computing the slack matrix of the perfect match- 
ing polytope give rise to exponential size extended formulations. The same holds if p{n) is a 
positive constant. For p(n) = Q((lg 2 n) / n), the size of the extended formulation is n^S"). 



6.3 Implications for spanning tree polytopes 

Incidently, an analogous statement holds for the spanning tree polytope of K n as well, even 
though for this polytope an extended formulation of size 0(n 3 ) exists. To show this we first 
prove a lemma analogous to Lemma [H 

Lemma 8. There is a reduction from the set disjointness problem for subsets of [n] to the support 
problem for the slack matrix of the spanning tree polytope ofK e , where I — 2n + 1, that uses no extra 
communication. 

Proof. For computing the slack matrix of the spanning tree polytope, Alice and Bob are given 
a set of vertices U and a spanning tree T respectively. They have to compute the number 
of connected components in T[U] minus one in expectation. For computing the support of 
the slack matrix they have to decide whether T remains connected after throwing away the 
vertices that are not in U. 

Given an instance of the set disjointness problem with sets A, B C [n], Bob creates a 
spanning tree T on £ := 2n + 1 vertices as follows. For every i £ [n] add the edge {z, 2n + 1} 
to T. For every i £ B add the edge {n + i,i} to T and for every i £ [n] — B add the edge 
{n + i,2n + 1} to T. See Figure|3]for an example. 

Alice lets U := {n + i : i £ A} U {2n + 1}. As is easily seen, T[U] is connected iff 
A n B = 0. Indeed, if i £ A n B then n + i and 2n + 1 are in different connected components 
of T[U]. Moreover, if A n B = then T[U] is a star with 2n + 1 as center. Also, Alice and 
Bob do not need to communicate in order to construct the spanning tree T and the subset 
U. □ 
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Figure 3: The spanning tree T for B = {1,2,4} and n = 7. Black vertices are those of the 
form i or n + i where i G B. 

Therefore, we have the following theorem. 

Theorem 9. Consider an extended formulation for the spanning tree polytope of K n and a corre- 
sponding protocol computing the slack matrix of this polytope. If the probability that the protocol 
outputs a non-zero value, given a pair (U, T) with positive slack, is at least pin) ^> 1/n, then the 
protocol has complexity Cl(np(n)) and the extended formulation has size 2°( m p( w )). 

The proof is identical to that of Theorem [7] and so we omit it. 

7 Concluding remarks 

Given a perfect matching M and an odd set U as above there is always an edge in S(U) flM. 
But it is not clear if such an edge can be found using a protocol with sublinear communi- 
cation. Now we show that if such an edge can be found using few bits then the perfect 
matching polytope has an extension of small size. 

Theorem 10. Suppose Alice is given an odd set U C [n] and Bob is given a perfect matching M of 
K n . Furthermore, suppose that Bob knows an edge e G S(U) n M. Then, there exists a randomized 
protocol of complexity 2 lgn + O(l) that computes the slack for the pair (U, M) in expectation. 

Proof. The protocol works as follows. Bob picks an edge e' from M \ {e} and sends it to 
Alice. She outputs n — 2 = \M\ — 1 if e' G S(U) and zero otherwise. The expected value of 
the protocol is (n - 2) • (\S(U) R M\ - l)/(n - 2) = \S(U) n M| - 1, as required. Bob needs 
to send the endpoints of the edge e' to Alice and this requires 2 lg n + 0(1) bits. □ 

The theorem above implies that if an edge in S(U) n M can be computed using a proto- 
col requiring o(n) bits, then there exists an extended formulation for the perfect matching 
polytope of subexponential size. We leave it as an open question to settle the existence of 
such a protocol. 
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A Other proofs 

Lemma 11. Let P = {x G R d : Ex ^ g} be a polytope, let S(P) = FV be a rank r non-negative 
factorization of the slack matrix of P with r := rank + (S(P)), and let Q := {(x,y) G R d+r : 
Ex + Fy = g,y ^ 0}. Then Q is bounded if and only ifP is bounded. 

Proof. Clearly, if P is unbounded then Q must be unbounded too. Now assume that P 
is bounded. The polyhedron Q is unbounded if and only if its recession cone rec(Q) = 
{(x,y) G R d+r : Ex + Fy = 0, y ^ 0} contains some nonzero point. Since P is bounded 
and the image of Q under the projection (x,y) i— > x is P, we have x = for every point 
G rec(Q). Therefore, Q is unbounded if and only if the system Fy = 0,y ^ has 
a solution y ^ 0. But any such y represents as a non-trivial conical combination of the 
column vectors of F. Since F is non-negative, this is only possible if one of the columns of F 
is identically zero, which would contradict the minimality of r. □ 

Lemma 12. Let n be an even positive integer. There exists a collection ofk = 0(2 n/2 v / nlnn) 
subsets X\, Xk of size n/2 of [n] such that for every perfect matching M of K n at least one of 
the subsets X ; - is compatible with M, that is, all the edges of M have one end in X z - and the other in 
Xi = [n] \ X t . 

Proof. Consider the following set covering instance: the universe U is the set of all 1 «/'\n/2)\ 
matchings of K n and there is a set S = S(X) for each (n/2)-subset X of [n] that contains all 
the perfect matchings M that are compatible with X. See, e.g., Vazirani fll8l for a detailed 
description of the set cover problem and approximation algorithms for it. 

A feasible fractional solution of this set covering instance takes each set X to an extent of 
l/2 n / 2 . This gives a feasible fractional solution because each given perfect matching M is 
compatible with exactly 2 n ^ 2 subsets X. (By symmetry considerations, it is possible to argue 
that this solution is actually optimal.) The value of this fractional solution is 

1 / n \ 2 n/1 
¥^\n/2j ^ yfR' 

at least for n sufficiently large. Thus the feasible integer solution given by the greedy algo- 
rithm fl2l is of size at most 

( 1+ln 24W)9^ 0(2 " /2 ^ ,8 " ) ' 

□ 
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